BIOTECHNOLOGY 
SYSTEMS 
BRANCH ..... 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical In^mationg 
Center (STIC) detected errors when processing the following computer readat&f 
form: 



Application Serial Number: 




Source: 



Date Processed by STIC: 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: natin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: natin3heln(a>.uspto.eov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM. ACCESSIBLE THROUGH THE U. S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual- 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3 .0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errorcd sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the IJSPTO website at the following address: 

http://www.uspto.gov/web/offices/pac/checker 



Raw Sequence Listing Error Summary . 

. / 

~ /»nDDPrTinM SERIAL NUMBER:, Qy^l\l> C ^ 

FRROR DETECTED gunttFSTED CORRECTION 

~ -7. DMA** HFADERS WHICH WERE INSERTED BY PTO SOFTWARE 

NEW RULES CASES: PLEASE «^^J^^^l next ,ine. 
Wrapped NuCeics ™<^™?Z^^^*^ 

Please adjust your right margin to .3. as this will prevent trapping . 



1 



Wrapped Aminos 



Incor/ect Line Length 

Misaligned Amino Acid 
Numbering 

Non-ASCII 



Variable Length 



Patentln ver. 2.0 "bug" A 



The amino acid numberAext at the end of each line "wrapped down to the next line. 
This may occur if your file was retrieved in a word processor after create* «. 
Please adjust your right margin to .3. as this will prevent "wrapping . 

The rules require that a line no. exceed 72 characters in length. This includes spaces. 

Sequence(s) contain n's or Xaa's which represented more than one residue. 

As per the rules, each n or Xaa can only represent a single residue. 

pIZ present the maximum number of each residue having variable ieng.h and 

indicate in the (ix) feature section that some may be missing. 

o . ., re-™ ■> n has caused the <220>-<223> section to be missing from amino acid 
„ug" ,n Patentln vers,on 2.0 £ ; » s« Mh. ^ ^ ^ ^ ^ ^ 

sequence(s) • ^ w " no j< ... ^ii. 



Skipped Sequences 
(OLD RULES) 



Skipped Sequences 
(NEW RULES) 



Use of n's" or Xaa's 
(NEW f&L^)' 



sections for Artificial or Unknown sequences. 

Sequence(s) _ missing. If intentional, please use the following forma, for each skipped sequence: 

"* ™ any headings under "SEQUENCE CHARACTERISTICS") 
(xi) SEQUENCE DESCRIPTIONS ID NO:X: 
This sequence is intentionally skipped 

Please also adjus. .he "(iii) NUMBER OF SEQUENCES:" response* include the skipped sequence(s). 
Sequcnce(s) missing. If intentional, p.ease use the fol.owing forma, for each skipped sequence. 
<210> sequence id number 
*400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 



Use of <213>Organism 
(NEW RULES) 

Use of <220> Feature 
(NEW RULES) 



Patentln ver. 2.0 "bug" 



Sequence(s) . 



are missing this mandatory field or its response. 



Please do no. use "Copy to Disk" (unction of Patentln version 2.0. This causes a corrupted 
- in missing mandate numeric identifiers and responses (as indeed on raw sequence ,s„ng>. 

Instead, please use "File Manager or any other means to copy file to floppy disk. 
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DATE* 03/29/2001 

SS B S°SSciJS" 08/09/673,300 TIMbI 10:28:05 DOeS Not Comply 

Corrected Diskette Needed 

Input Set : A:\ES.txt 

Output Set: N:\CRF3\03292001\I67 3300 . raw 

<110> APPLICANT: SDNTORY LIMITED Drote in having glycosyl transferase 

<120> TITLE OF INVENTION: Gene coding for a protein navmg g y , 

<140> CURRENT 0 APPLI CAT ION NUMBER: US/09/673,300 yx Aji^^> 

<141> CURRENT FILING DATE: 2000-10-16 ^-/^ <~ 

<160> NUMBER OF SEQ ID NOS:/6 J 
ERRORED SEQUENCES h/rf^ 1 ***]^* 




191 <210> SEQ ID NO 

192 <211> LENGTH: 25 

193 <212> TYPE: DNA 

194 <213> ORGANISM: Artificial Sequence 

196 <220> FEATURE: ^ ^ 

197 <223> OTHER INFORMATION: Primer I ^jJuJ( ^ 

E--> 199 <400> SEQUENCE: 3£ ■ ' 25 

200 ataactacat atgggacaac tccac 

628 <210> SEQ ID NO: 14 

629 <211> LENGTH: 23 

630 <212> TYPE: DNA 

631 <213> ORGANISM: Art 

633 <220> FEATURE: 

634 <223> OTHER INFORMATION: Primer 

636 <400> SEQUENCE: 14 23 

637 ctcgtaccat ggaaaactat tct 
E--> 642/2/1 
E--> 643 f 



■tificial Sequence 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/673,300 



DATE: 03/29/2001 
TIME: 10:28:06 



Input Set : A:\ES.txt 

Output Set: N:\CRF3\03292001\l673300.raw 



L:8 M:270 C: Current Application Number differs, Replaced Current Application No 
L:8 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:0 M:201 W: Mandatory field data missing, FILE REFERENCE 

L:199 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQUENCE ID NOS : 3 differs:2 
L:220 M:283 W: Missing Blank Line separator, <400> field identifier 
L:642 M:254 E: No. of Bases conflict, LENGTH : Input : 18 Counted:24 SEQ:14 
L:642 M:320 E: (1) Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS : 2 
M:254 Repeated in SeqNo=14 

L:643 M:320 E: (1) Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS : 2 

L^643 M:252 E: No. of Seq. differs, <211>LENGTH : Input : 23 Found:25 SEQ:14 

L:8 M:203 E : No. of Seq. differs, <160> Number Of Sequences : Input (6) Counted (14) 
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